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Abstract — This paper studies a spectrum sharing scenario 
between a cooperative relay network (CRN) and a nearby ad-hoc 
network. In particular, we consider a dynamic spectrum access 
and resource allocation problem of the CRN. Based on sensing 
and predicting the ad-hoc transmission behaviors, the ergodic 
traffic collision time between the CRN and ad-hoc network is 
minimized subject to an ergodic uplink throughput requirement 
for the CRN. 

We focus on real-time implementation of spectrum sharing 
policy under practical computation and signaling limitations. In 
our spectrum sharing policy, most computation tasks are ac- 
complished off-line. Hence, little real-time calculation is required 
which fits the requirement of practical applications. Moreover, 
the signaling procedure and computation process are designed 
carefully to reduce the time delay between spectrum sensing and 
data transmission, which is crucial for enhancing the accuracy of 
traffic prediction and improving the performance of interference 
mitigation. The benefits of spectrum sensing and cooperative 
relay techniques are demonstrated by our numerical experiments. 

Index Terms — Ad-hoc Network; Cooperative Relay Network; 
Spectrum Access; Traffic prediction; Resource Allocation; Real- 
time Implementation. 

I. Introduction 

In recent years, spectrum sharing between heterogeneous 
wireless networks has been studied intensively as a crucial 
technology for improving network spectrum efficiency [1] 
and network capacity [2]. Traffic prediction based spectrum 
access polices were proposed in [3]-[9], where the cogni- 
tive transmitter detects and predicts the primary user's (PU) 
transmission behaviors and transmits signals opportunistically 
to avoid collisions with the PU's traffic. Joint optimization 
of spectrum access and resource allocation based on traffic 
prediction has been proposed in [10] for an open sharing model 
[1] that considers spectrum sharing between an uplink system 
and an ad-hoc network. In [11], cooperative relay technique 
was utilized to improve the spectrum sharing performance. 

However, some implementation issues were rarely consid- 
ered in these studies. First, determining the resource allocation 
policy in real-time can be computationally quite demanding 
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for realistic wireless communication systems [12]. Second, 
spectrum sensing and channel estimation are usually per- 
formed at spatially separate nodes, which requires to exchange 
their obtained information before solve the resource allocation 
problem. The resultant signaling procedure and the computa- 
tion of resource allocation solution would cause a large time 
delay between spectrum sensing and data transmission, which 
would degrade the accuracy of traffic prediction and cause 
unexpected traffic collisions between the networks operating 
in the same spectrum. Therefore, resource allocation policies 
with little real-time calculation and small sensing-transmission 
delay are of great interest for practical applications. 

In this paper, we study spectrum sharing between a coopera- 
tive relay network (CRN) and an ad-hoc network, as illustrated 
in Fig. 1. The relay assists the transmissions from the mobile 
terminal (MT) to the base station (BS) to achieve higher uplink 
throughput. In order to communicate with the distant BS, the 
MT and relay would transmit signals with peak powers, which 
induce strong interference to nearby ad-hoc links. The ad- 
hoc transmitters (e.g., wireless sensor nodes) have relative low 
transmission powers due to their short communication ranges, 
and thus their interference to the relay and BS can be treated 
as noise. Such an asymmetrical interference scenario is known 
as the "near-far effect" [2]. 

We consider a joint spectrum access and resource allocation 
problem of the CRN, where the ergodic traffic collision time 
between the CRN and ad-hoc network is minimized subject 
to an ergodic uplink throughput constraint for the CRN. The 
formulated design problem is a difficult nonconvex optimiza- 
tion problem with no closed-form expression for the objective 
function. By carefully analyzing the problem structure, we 
show how this problem can be reformulated as a convex 
problem. A low-complexity Lagrangian optimization method 
is used to solve the considered design problem efficiently. 
Then, a real-time implementation policy is proposed which 
requires little real-time calculation and has small sensing- 
transmission delay. Finally, numerical results are provided to 
show the benefits of our spectrum sharing policy. 

II. System model 

The CRN operates in frames with duration Tf. Each frame 
comprises TV sub-channels in frequency domain, denoted by 
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Fig. 1: System setup of the spectrum sharing between coop- 
erative relay and ad-hoc networks. 

the set M = {1,2,- •• ,N}. We assume that the wireless 
channels of source-relay (S-R), source-destination (S-D), and 
relay-destination (R-D) links are block-faded, which vary 
across the frames in a stationary and ergodic manner. The 
channel gain normalized by the interference plus noise power 
of these links are denoted by g n r ,g n d ,g n d , respectively, for 
the n-th sub-channel. 

In practice, the relay node operates in a half-duplex mode. 
Therefore, each frame consists of 2 phases: In Phase 1, the 
source transmits signal to the relay and destination via a 
broadcast channel; in Phase 2, the source transmits a new 
information message, and, at the same time, the relay uses 
the DF relaying strategy to forward its received information 
message in Phase 1 to the destination, which forms a multiple- 
access channel. These operations are illustrated in Fig. 1. The 
time durations of Phase 1 and Phase 2 are set to aTf and 
(1 — a)Tf, respectively, where a G (0, 1). 

The ad-hoc links operate in M non-overlapping frequency 
bands denoted by the set M = {1, 2, • • • , M} and the ra- 
th ad-hoc band overlaps with a set of sub-channels given by 
Mn CAf = UliXn and Mm fWz = if m ^ I). The ad- 
hoc traffic in the ra-th band is modeled by a strictly stationary, 
ergodic and independent binary continuous-time Markov chain 
(CTMC) X m (t), where X m (t) = 1 (X m (t) = 0) represents an 
ACTIVE (IDLE) state at time t. The holding (or sojourn) peri- 
ods of ACTIVE and IDLE states are exponentially distributed 
with rate parameters A and /i, respectively. The probability 
transition matrix of the CTMC model of Band ra is given by 
[13, p. 391] 



P(t) = 
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where the element in the (i + l)-th row and (j + l)-th column 
of P(t) stands for the transition probability Pr{X m (t + r) = 
j\X m (r) = i} for ij G {0,1}. This CTMC model has 
been considered in many spectrum sharing studies including 
theoretical analysis and hardware tests; see [3]-[ll]. 

The source and relay detect the ACTIVE/IDLE state of each 
ad-hoc band at the start of both Phase 1 and Phase 2. The 
sensing outcome of the two phases are denoted by X m (Q) = 
x m G {0,1} and X m (aT f ) = y m G {0,1}, respectively. 
Perfect sensing and negligible sensing overhead are assumed 
in this paper. 

III. Problem formulation 

Let us define u = {g n r g n d : , g n d ,x m ,y m ,n G N, m G 
M} as the network state information (NSI). The dynamic 



transmission parameters are determined by the instant NSI 
u). Suppose that the source and relay nodes can switch on 
and off their transmissions freely over each sub-channel, and 
may transmit only in part of the time during Phase 1 and 
Phase 2. Let ln\uj) C [0, oiTf) denote the set of transmission 
time of the source over Sub-channel n in Phase 1, and 
I { n\u>) C [aT f ,T f ] denote that of the source and relay in 
Phase 2, for n = 1,...,7V. In\u) and In\u;) each may 
be a union of several disjoint transmission time intervals. We 
utilize the words "traffic collision" to represent the event that 
both the CRN and ad-hoc network are transmitting in the 
same spectrum band at the same time. In [14], we showed 
that ergodic traffic collision time between the two networks is 
given as 




rn=l 



Pr{X m (a) = l|X m (0) = x m } da 



Pr{X m (a) = l\X m (aT f ) = y m } da 



,(2) 



which is proportional to the transmission error probability of 
the ad-hoc network in strong interference scenarios [4]. 

Let tt(S) represents the size (measure) of set S; for example, 
7r([a, b]) = b — a. Thus, the transmission time fractions of 



the CRN are determined as K n \uo) = tt(K } (u))/T f and 

(2) (2) 

6n {w) = 7r(In /(u))Tf, respectively, for Phase 1 and Phase 
2 of the frame. Then, the ergodic achievable rate of the CRN 
can be expressed as [14] 
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Note that this ergodic rate can be achieved in slow-fading en- 
vironment by means of queuing at the relay node. Moreover, it 
is a concave function of {P$ (w), P& (w), P r ,n(w), 0^ (w), 
6n\u;), n G N}, since the perspective of a concave function 
is also concave [15, p. 89]. 

The joint spectrum access and resource allocation problem 
the CRN is formulated as 
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Fig. 2: Time-frequency transmission structure by Lemma 1. 

Pi^M,P^(«),Pr,n(«)>0 (8) 

4 1 >(aOC[0,aT / ], fHcK^T;] (9) 
tt(I«M) = 9P(u)T f , TrflgV)) = 6><?> (10) 

IV. The solution to the problem (P) 
Problem (P) is difficult to solve mainly because it is 
hard to determine the sets ln\uj) and ln\w) and thus the 
objective function / has no closed-form expression in general. 
Fortunately, these issues can be resolved and eventually (P) 
can be reformulated as a convex optimization problem, as we 
present in the following. 

A. Transformation of (P) to a convex problem 

In [14], we show that the optimal spectrum access should 
satisfy the following two principles: 

1) The source and relay nodes should transmit as soon (late) 
as possible if the sensing outcome is IDLE (ACTIVE); 

2) The CRN should have identical spectrum access policy 
for all sub-channels in M m ; that is, 1^ = 1^ for all 
p,qeJV m and i G {1,2}. 

Let us define 

0%(u) ^ max {9«(w),nG M m ) , (11) 

for m = 1, . . . , M and i = 1,2. The above principles are 
formalized in the following Lemma: 

Lemma 1 [14] For any given transmission time fractions 
{0 { m\u) G [0, a], 0^(0;) G [0,1 — a]}£f =1 , we have that: 

1) The optimal spectrum access policy of Phase 1 is 
given by $\u>) = [0,0&\u>)T f ] (^(w) = [(a - 
Om^ (oj))Tf, aTf]) for all n G M m , if the sensing outcome 
of Phase 1 is x m = (x m = 1); 

2 ) The optimal spectrum access policy of Phase 2 is given 
by ll 2) (u;) = [aT» + ^ } M)7/] (^(v) = [(1 - 
0rn(u>))Tf, Tf]) for all n G M m , if the sensing outcome 
of Phase 2 is y m = (y m = 1), 

An example of the spectrum access policy in Lemma 1 is 
shown in Fig. 2. According to Lemma 1, each term inside the 
expectation in (2) can be greatly simplified. For 6 G [0,a], 
define the functions 
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Pr(X m (t) = l|X m (aT f ) =0)dt 
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(14) 



Pr(X m (i) = l\X m (aT f ) = l)dt 

[Tf-eT f ,T f ] 

Iff i r -(X+Ml-a)T f \ e (\+p)eT f _ { 



(15) 



X+fj, \ (\+fi)T f 

It is easy to prove that the functions (6; x) are strictly 
convex in 6 by considering their secondary derivations. Then, 
the interference metric in (2) can be reformulated as 



( M 
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After some simple manipulations, the problem (P) can be 
reformulated as a convex optimization problem, i.e., 
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where i?i , i?2 are determined by 
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5. r/z^ optimal solution of (P) 

By solving the KKT conditions of the derived convex op- 
timization problem (17)-(22), we derived the optimal solution 
for each realization of the NSI u> and fixed dual variables [14]: 

The optimal value of the ratio Ps*n(u)/0rn is given by 

Ps { !n(u>) 



= positive root x of (26) if it exists, otherwise 0,(25) 



4 



and the root x is determined by 



the dual point v is given by 



°9n' 



Cmax{^ r ,^ d } 

i i r s.r s,d-\ -1 , s.c 
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em 2, 



(26) 



which is equivalent with a quadratic equation with closed-form 
solutions. 

(2) (2) 

The optimal values of the ratios Ps,n (<*>)/ 0m (<*>) and 

(2} 

P r ,n(v) /9m (v) are given by 
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with (•)+ = max(-,0), if P r5U (cc>) > is satisfied. Otherwise, 
if P r?n (u>) = 0, we obtain 



P$(u>) 

Pr,n(v) 



eln2 
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(29) 
(30) 



The optimal value of 0m\u;) is determined as 



(a;): 



(A+M)T/ 



ln<^ 1- 



A 



E 




0, 



, if x r . 



if ^777, 



(31) 



where the value of Pi,n(^)/^m^(^) is given by (25), /(#) = 
log 2 (1 + x)- (1+ ^ )ln2 , [x\l = min{max{x,0},?/},andln(x) 
is extended to take the value — oo for x G (— oo, 0] to simplify 
the formulations. The optimal value of 0$ (<jj) is given by 
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where the values of Ps,n (<*>)/ 0m (w) and P r?n (u>)/#V(<^) 
are given by (27)-(30). Substituting (31)-(32) into (25)-(30), 
the optimal values of Pj 5 n(u>), Pi 2 J(cj), P r5U (u>) are derived. 

We now optimize the dual variables v = {£, cr, £,v} T by 
the subgradient method [14], where the subgradient h{v) at 



h(u) = 
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where Pj.n^V), P$*(w) and P r * n (w) are derived throught 
(25)-(32) at the dual point is, and and P^ are the corre- 
sponding rate values in (23) and (24), respectively. 

C. Real-time implementations 

In the following, we show that dual variable v can 
be optimized off-line, which reduces the amount of real- 
time computations greatly. Moreover, by utilizing the struc- 
ture of the optimal solution (25)-(32), the primal solu- 
tions {P$(w),Pi^ can be 
updated on-line efficiently based on real-time NSI uj of each 
frame, while generating quite short sensing-transmission delay. 

1 ) Off-line dual optimization: These expectations (23), (24) 
and (33) do not have closed-form expressions. In practice, 
one can compute the subgradient h{y) by means of Monte 
Carlo simulations. Specifically, one may randomly generate a 
set of realizations of the NSI u> following the distributions 
of the CQIs and sensing outcomes. Then, the expectation 
terms in (23), (24) and (33) can be obtained by computing 
(25)-(32), (23) and (24) for each realization of oj, and then 
averaging the corresponding terms in (23), (24) and (33) over 
these realizations. By this, the subgradient updates with high 
computation burden can be performed off-line without using 
real-time NSI. 

2) On-line primal solution update: In practice, the BS 
(destination) acquires the CQI {<' r (/), 9% d (l), 0£ d (O>n=i 
of Frame I even before Frame I starts through prediction 
[16], if the wireless channel varies slowly across the frames. 

Therefore, the BS can compute the ratio — 



and 



according to (26)-(30) in Frame I — 1. While the 
sensing outcome x m (l) and y m {l) is still unknown at the BS 
at this moment, the BS can compute 0m\&i) and 0m\&i) 
in (31) and (32) by considering the two possible values of of 

Xm{l) and y m (l), respectively. Then, the BS sends 



and the possible vales of 0$ (u^) and 0$ (u>i) to the 

and the possible 



MT before Frame I starts, and sends Pr,ri( ^ 



(2) 

values of 9 m to the relay before Phase 2 of Frame I starts. 

After receiving the feedbacks from the destination, the MT 
performs spectrum sensing at the beginning of Phase 1, and 
then selects the value of 0$ '(u^) according to the sensing 
outcome x m (l). After Phase 1 of Frame /, the MT and relay 
node perform spectrum sensing again at the beginning of Phase 

(2) 

2, and then selects the value of m J (u>i) in accordance with the 
sensing outcomes y m (l). Therefore the MT and relay nodes 
can transmit information signals right after spectrum sensing 
with almost no sensing-transmission delay. 
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Fig. 3: The interference mitigation performance of different 
spectrum sharing policies. 



V. Numerical experiments 

We now compare our dynamic transmission policy with 2 
reference policies: 

1. The first one is a relay -free policy [10], where the source 
transmits signals directly to the destination without using the 
relay node. 

2. We then consider a time-hopping random access policy 
with no spectrum sensing [17], where the CRN's transmission 
time is chosen randomly in each frame like frequency hopping. 
In this policy, the transmission time of the CRN satisfies 
0$ (wi) = for n = 1, . . . , N, i = 1, 2 and all frame index 
/, and the transmission powers of the source and relay are 
allocated optimally to maximize Rdf- 

We consider that the source, relay and destination stands 
in a line and the relay locates in the middle of the source 
and destination. The CRN has N = 16 sub-channels, and 
the ad-hoc network has M = 4 bands. Thus, each ad-hoc 
band overlaps with 4 CRN sub-channels. The channel gain 
between every two nodes of the CRN at each sub-channel can 
be decomposed into a small-scale Rayleigh fading and a large- 
scaled path loss component with a path-loss factor of 4. The 
small-scale fading are i.i.d. across the sub-channels to simulate 
a frequency- selective environment. We assume that the power 
constraints of the source and relay nodes are the same and the 
signal-to-interference-plus-noise ratio (SINR) of the source- 
destination link is Pmax ^[^ - = 5dB. The parameters of ad- 
hoc traffic model satisfy fiTf = AT/ = 1. The value of a is 
chosen to be 0.5. 

Figure 3 illustrates the interference mitigation performance 
of the spectrum sharing policies. We find that our policy 
achieves better interference mitigation performance than the 
reference policies. More specifically, the relay-free policy is 
slightly worse than our policy in low spectrum efficiency 
region. However, if the required uplink spectrum efficiency is 
relative high, the interference mitigation performance of relay- 
free policy is quite poor, because of its relative low capacity. 
The spectrum efficiency of our policy is 80% higher than that 



of the relay-free policy, when the ergodic traffic collision time 
per second is larger than 0.01s. The time-hopping policy has 
quite poor performance for relative low spectrum efficiency, 
because it has not utilized the spectrum sensing results. 

VI. Conclusions 

This paper studied a spectrum sharing scenario between co- 
operative relay and ad-hoc networks. A dynamic transmission 
policy of the CRN is proposed which requires little real-time 
computation and guarantees high traffic prediction accuracy. 
The benefits of spectrum sensing and cooperative relay tech- 
niques are demonstrated by our numerical experiments. 
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